CDS

Accession Number TCMCG022C03382
gbkey CDS
Protein Id XP_010029896.1
Location complement(join(39962089..39962193,39962297..39962575,39962679..39962954,39963107..39963325,39963452..39963620,39963899..39964073,39964229..39964340,39964868..39965101))
Gene LOC104419813
GeneID 104419813
Organism Eucalyptus grandis

Protein

Length 522aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA698663
db_source XM_010031594.3
Definition squalene monooxygenase SE1 [Eucalyptus grandis]

EGGNOG-MAPPER Annotation

COG_category I
Description squalene
KEGG_TC -
KEGG_Module -
KEGG_Reaction R02874        [VIEW IN KEGG]
KEGG_rclass RC00201        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K00511        [VIEW IN KEGG]
EC 1.14.14.17        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko00100        [VIEW IN KEGG]
ko00909        [VIEW IN KEGG]
ko01100        [VIEW IN KEGG]
ko01110        [VIEW IN KEGG]
ko01130        [VIEW IN KEGG]
map00100        [VIEW IN KEGG]
map00909        [VIEW IN KEGG]
map01100        [VIEW IN KEGG]
map01110        [VIEW IN KEGG]
map01130        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGGACGGTCAGTACTTGGTCAGTGGCGTCTTGGCTCTGTTCCTGGGGATCTTCCTGCTGTACAAGGGGCTCGGGAAGCAGAAGAGGAGGCTGTCCAAGAAGGGTCGCGGCGATGACTATGTGAAGAGCTCTGTGGATGGAGGTTTCGTGCCCGGCGGCGTCGATGGGAGCACCGACATCGTCATTGTCGGAGCAGGCGTCGCCGGTGCGGCTCTCGCTTACGCGCTCGGGAAGGATGGACGTCGCGTGCGTGTAATTGAGAGGGACCTGACGGAGCAAGATAGAATTGTCGGCGAGCTTCTTCAACCAGGAGGTTACCTGAAATTGATGGAATTGGACCTTGCAGATTGCGTGCAAACAATTGATGCCCAAAGAGTTGTCGGATATGCTCTTTTCAAGGATGGGAAACACACGCAGTTGTCTTATCCATTGGAGAATTTTCATTCGGATGTCGCCGGCAGAGGTTTTCACAACGGCCGTTTCGTTCAAAGCATGAGGGAAAAGGCCGCAACTCTTCCAAACGTAAGTCTAGAACAAGGGACAGTAACATCTCTAATTGAGGAAAAGGGAACTGTCAAGGGAGTGCAATACAAGACCAAGGCCGGGGAAGAGTTGAAAGCATATGCTCCCCTCACCATCGTATGTGACGGTTGCTTTTCAAACCTACGCCGTAACCTCTGCTTTCCGAAGGTTGATGTCCCCTCTCATTTCGTGGGGTTGGTCGTGGAGAATTGTGATCTTCCATTCCCAAATCACGGCCATGTCATACTGGCAGACCCTTCGCCTATCTTATTTTATCCGATCAGCAGCACTGAGATCCGTTGTCTGGTCGATGTCCCTGGCCAGAAATTGCCCTCTTTAGCCAGCGGTGAAATGGCCACATATTTGAAGACAAAGGTTGCTCCCCAGGTTCCCCCTCAATTGTACAAAGCCTTCATCGCAGCAATTGACAAGGGAAACATCAAGTCGATGCCAAATAGAAGCATGCCTGCCAATCCTCAACCCACCCCTGGAGCTCTTCTGATGGGAGACGCGTTCAACATGCGCCATCCATTGACAGGAGGAGGAATGACCGTGGCTCTTTCTGATATCGTTTTGCTAAGGAACCTCCTTCGCCCACTTCAGGATCTGAATGATGCATCTGCTCTATGCAAATATCTCGAGTCGTTCTATACACTGAGGAAGCCTGTGGCGTCGACCATCAACACCCTGGCTGGTGCTCTGTACAAGGTCTTCTGTGCATCTCCAGACCCGGCAAGAAAGGAAATGCGCCAGGCATGCTTCGACTATCTGAGCCTTGGCGGTCTCTGCTCAACTGGGCCAGTCTCTCTGCTCTCGGGTCTAAACCCCCGTCCAATGCACTTGGTCTGCCATTTCTTTGCAGTAGCAGTATATGGTGTCGGGCGGCTATGTCTTCCATTCCCTTCGCCGAAACGCATGTGGCTCGGGGCCAGACTGGTTAAGGGTGCATCAGGTATCATCTTTCCCATAATAAGGGATGAAGGAGTAAGGCAGATGTTCTTCCCTGCGACTGTGCCGGCTTACCACAGAGCTCCTCCTGTTCACTGA
Protein:  
MDGQYLVSGVLALFLGIFLLYKGLGKQKRRLSKKGRGDDYVKSSVDGGFVPGGVDGSTDIVIVGAGVAGAALAYALGKDGRRVRVIERDLTEQDRIVGELLQPGGYLKLMELDLADCVQTIDAQRVVGYALFKDGKHTQLSYPLENFHSDVAGRGFHNGRFVQSMREKAATLPNVSLEQGTVTSLIEEKGTVKGVQYKTKAGEELKAYAPLTIVCDGCFSNLRRNLCFPKVDVPSHFVGLVVENCDLPFPNHGHVILADPSPILFYPISSTEIRCLVDVPGQKLPSLASGEMATYLKTKVAPQVPPQLYKAFIAAIDKGNIKSMPNRSMPANPQPTPGALLMGDAFNMRHPLTGGGMTVALSDIVLLRNLLRPLQDLNDASALCKYLESFYTLRKPVASTINTLAGALYKVFCASPDPARKEMRQACFDYLSLGGLCSTGPVSLLSGLNPRPMHLVCHFFAVAVYGVGRLCLPFPSPKRMWLGARLVKGASGIIFPIIRDEGVRQMFFPATVPAYHRAPPVH